Rejoinder: Struggles with Survey Weighting and Regression Modeling
نویسنده
چکیده
Encountering this sort of statement in the documentation of opinion poll data we were analyzing in political science: “A weight is assigned to each sample record, and MUST be used for all tabulations.” (This particular version was in the codebook for the 1988 CBS News/New York Times Poll; as you can see, this is a problem that has been bugging me for a long time.) Computing weighted averages is fine, but weighted regression is a little more tricky—I do not really know what a weighted logistic regression likelihood, for example, is supposed to represent. Constructing the weighting for the New York City Social Indicators Survey (SIS). It quickly became clear that we had to make many arbitrary choices about inclusion and smoothing of weighting variables, and we could not find any good general guidelines. We wanted to estimate state-level public opinion from national polls. If our surveys were simple random samples, this would be basic Bayes hierarchical modeling (with 50 groups, possibly linked using state-level predictors). Actually, though, the surveys suffer differential nonresponse (lower response by men, younger people, ethnic minorities, etc.) as signaled to the user (such as myself) via a vector of weights.
منابع مشابه
Comment: Struggles with Survey Weighting and Regression Modeling
Andrew Gelman’s article “Struggles with survey weighting and regression modeling” addresses the question of what approach analysts should use to produce estimates (and associated estimates of variability) based on sample survey data. Gelman starts by asserting that survey weighting is a “mess.” While we agree that incorporation of the survey design for regression remains challenging, with impor...
متن کاملStruggles with Survey Weighting and Regression Modeling
The general principles of Bayesian data analysis imply that models for survey responses should be constructed conditional on all variables that affect the probability of inclusion and nonresponse, which are also the variables used in survey weighting and clustering. However, such models can quickly become very complicated, with potentially thousands of post-stratification cells. It is then a ch...
متن کاملComment: Struggles with Survey Weighting and Regression Modeling
I appreciate the opportunity to comment on Andrew Gelman’s interesting paper. As an admirer of Gelman’s work, it is a pleasure to read his take on the topic of survey weighting, which I have always found fascinating. Since I support Gelman’s general approach, I focus on reinforcing some points in the article and commenting on some of the modeling issues he raises. As a student of statistics, I ...
متن کاملComment: Struggles with Survey Weighting and Regression Modeling
In the ideal samples of survey sampling textbooks, weights are the inverses of the inclusion probabilities for the units. But nonresponse and undercoverage occur, and survey statisticians try to compensate for the resulting bias by adjusting the sampling weights. There has been much debate about when and whether weights should be used in analyses, and how they should be constructed. Professor G...
متن کاملComment: Struggles with Survey Weighting and Regression Modeling
This is an intriguing paper that raises important questions, and I feel privileged for being invited to discuss it. The paper deals with a very basic problem of sample surveys: how to weight the survey data in order to estimate finite population quantities of interest like means, differences of means or regression coefficients. The paper focuses for the most part on the common estimator of a po...
متن کامل